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Abstract 

Background: Sox6 is a multi-faceted transcription factor involved in the terminal differentiation of many different 
cell types in vertebrates. It has been suggested that in mice as well as in zebrafish Sox6 plays a role in the terminal 
differentiation of skeletal muscle by suppressing transcription of slow fiber specific genes. In order to understand 
how Sox6 coordinately regulates the transcription of multiple fiber type specific genes during muscle 
development, we have performed ChlP-seq analyses to identify Sox6 target genes in mouse fetal myotubes and 
generated muscle-specific Sox6 knockout (KO) mice to determine the Sox6 null muscle phenotype in adult mice. 

Results: We have identified 1,066 Sox6 binding sites using mouse fetal myotubes. The Sox6 binding sites were 
found to be associated with slow fiber-specific, cardiac, and embryonic isoform genes that are expressed in the 
sarcomere as well as transcription factor genes known to play roles in muscle development. The concurrently 
performed RNA polymerase II (Pol II) ChlP-seq analysis revealed that 84% of the Sox6 peak-associated genes 
exhibited little to no binding of Pol II, suggesting that the majority of the Sox6 target genes are transcriptionally 
inactive. These results indicate that Sox6 directly regulates terminal differentiation of muscle by affecting the 
expression of sarcomere protein genes as well as indirectly through influencing the expression of transcription 
factors relevant to muscle development. Gene expression profiling of Sox6 KO skeletal and cardiac muscle revealed 
a significant increase in the expression of the genes associated with Sox6 binding. In the absence of the Sox6 
gene, there was dramatic upregulation of slow fiber-specific, cardiac, and embryonic isoform gene expression in 
Sox6 KO skeletal muscle and fetal isoform gene expression in Sox6 KO cardiac muscle, thus confirming the role 
Sox6 plays as a transcriptional suppressor in muscle development. 

Conclusions: Our present data indicate that during development, Sox6 functions as a transcriptional suppressor of 
fiber type-specific and developmental isoform genes to promote functional specification of muscle which is critical 
for optimum muscle performance and health. 



Background 

Skeletal muscle in vertebrates has evolved to be a major 
organ system with great adaptability in order to respond 
to constantly changing physical demands placed upon it. 
This adaptability is achieved by the ability of muscle 
fibers to change their contractile and metabolic proper- 
ties. Adult skeletal muscle consists of two major fiber 
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groups, slow-twitch and fast-twitch. In general, slow 
fibers are best fit for long-lasting aerobic activity 
whereas fast fibers are best fit for short bouts of anaero- 
bic activity [1]. At the molecular level, a coordinated 
expression of multiple fiber type-specific genes, both 
structural and enzymatic, is required to give each fiber 
type its unique characteristics. Slow and fast muscle 
fibers are operationally defined by the expression of the 
isoforms of myosin heavy chain (MyHC) [2]. In adult 
rodent skeletal muscle, slow fibers are defined by the 
expression of MyHC-p, whereas fast fibers are defined 
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by the expression of three MyHC isoforms, Ha, Ilx/d, 
and lib (contractive speed: IIa<IIx/d<IIb) [3]. In devel- 
oping fetal rodent muscle, instead of MyHC-IIa, Ilx/d, 
and lib, there are two developmental MyHC isoforms 
(embryonic and perinatal) that are expressed, along with 
MyHC-p, at different stages of development [4,5]. After 
birth, expression of embryonic and perinatal MyHC iso- 
forms as well as MyHC-P is significantly downregulated 
and the majority of the rodent muscle becomes fast 
MyHC-expressing fibers with exception of weight bear- 
ing core muscles such as soleus where slow MyHC-P is 
highly expressed [5-7]. In adult muscle, the main deter- 
minant of muscle fiber type is motoneuron input 
[3,8-11]. Several mediators and transcription factors 
have been identified for the nerve dependent fiber type 
regulation in adult skeletal muscle [3]. In contrast, our 
knowledge about factors that regulate fiber type differ- 
entiation during skeletal muscle development is still lim- 
ited. We have previously reported that Sox6 mutant fetal 
and perinatal skeletal muscle exhibits a significant 
increase in slow fiber type-specific gene expression 
accompanied by a significant decrease in fast fiber type- 
specific gene expression [12,13]. Based on these observa- 
tions, we have proposed that Sox6 functions as a tran- 
scriptional suppressor of slow fiber specific genes in 
developing skeletal muscle. 

Sox6 is a member of the evolutionarily highly con- 
served Sox transcription factor family [14-17]. Between 
mice and humans, the overall amino acid sequence of 
the Sox6 protein is approximately 95% conserved, and 
the functional domains are 100% conserved [18]. The 
Sox proteins contain the Sry-related HMG box domain 
which mediates sequence-specific DNA binding [16,17]. 
In general, the specificity of Sox protein targets in each 
cell type is regulated by their cofactors [16,19], a prop- 
erty that is especially important for the Sox6 protein 
since it lacks a regulatory domain (activator or repres- 
sor). Therefore, when Sox6 is involved in transcriptional 
regulation, cofactors of Sox6 dictate whether the out- 
come is activation or repression [15,16]. For example, 
Sox6 activates cartilage specific gene transcription as 
part of the Sox trio proteins (Sox5, Sox6 and Sox9) 
[20-22]. In other cell types, Sox6 suppresses transcrip- 
tion of the fgf3 gene or the cyclinDl gene by associating 
with repressors [23,24]. In the case of skeletal muscle, 
we have shown that Sox6 suppresses transcription of 
slow fiber specific genes during development, thus play- 
ing a critical role in initial muscle fiber type differentia- 
tion [12,13]. 

In the present study, to start to uncover how Sox6 
regulates transcription of fiber type specific genes at the 
molecular level, we used a conditional Sox6 allele [25] 
to inactivate Sox6 in developing skeletal muscle. The 
muscle specific inactivation of Sox6 allowed us to 



overcome the perinatal lethality of Sox6 mutant mice 
[26,27] and obtain Sox6 knockout (KO) adult skeletal 
muscle for in-depth analysis. To identify Sox6 target 
genes and assess their transcriptional status, we con- 
ducted ChlP-seq analyses using Sox6 and RNA polymer- 
ase II (Pol II) antibodies. Combining these methods, we 
demonstrate that: (1) Inactivation of Sox6 results in an 
extreme upregulation in expression of slow fiber speci- 
fic, cardiac and fetal isoform genes, suggesting that Sox6 
is required for the functional maturation of skeletal 
muscle, and (2) Sox6 binds to the DNA sequences in 
the vicinity of these genes, and thus is directly involved 
in the transcriptional suppression of its target genes. 
These results indicate that Sox6 plays a critical role in 
functional specification of muscle during development. 

Results 

The expression level of MyHC-p is dramatically increased 
in Sox6 KO muscle during development 

We have previously shown that in the Sox6 null fetal 
skeletal muscle, nascent fast muscles maintain slow 
MyHC-P expression [13]. In addition to MyHC-p, other 
slow fiber specific genes (e.g. Tnncl, Tnnil, Tnntl, and 
My 12) are also upregulated in the Sox6 null muscle, 
along with significant downregulation of multiple fast 
fiber specific genes [12,13]. Based on these results, we 
proposed that Sox6 functions as a suppressor of slow 
fiber specific genes, thus the loss of Sox6 leads to an 
increase in slow muscle fibers. Since Sox6 null muta- 
tions cause early postnatal lethality [26,27], we were 
unable to determine whether this Sox6 null fetal pheno- 
type is maintained through postnatal development. To 
overcome the lethal phenotype, we utilized mice carry- 
ing a Sox6 conditional allele [25] to inactivate Sox6 spe- 
cifically in skeletal muscle. To start assessing the 
phenotype of adult Sox6 KO muscle, we first used the 
Myf5-Cre mouse [28]. In this Cre-transgenic mouse, the 
Cre recombinase under the control of the Myf5 promo- 
ter is expressed very early in the skeletal muscle lineage 
(starting at approximately E8 in somites); therefore, the 
inactivation of Sox6 occurs significantly earlier than the 
beginning of fiber type specification [4,5]. To conduct a 
comprehensive analysis of the Sox6 KO muscle pheno- 
type, we examined four different muscles in the hin- 
dlimb, the tibialis anterior (TA, fast), extensor digitorum 
longus (EDL, fast), gastrocnemius (fast), and soleus 
(slow) [6]. The mRNA expression of the following four 
genes: slow MyHC-P (MyhT), fast MyHC-IIb (Myh4), 
peroxisome proliferative activated receptor y coactivator 
la (Ppargcla), and succinate dehydrogenase complex 
subunit A (Sdha) were determined by reverse transcrip- 
tion-quantitative PCR (RT-qPCR) and compared 
between Sox6 KO (Sox6 f/f ; Myf5-Cre) and control 
(Sox6 f/f ) mice. As summarized in Table 1, Sox6 
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Table 1 Fold change in mRNA levels in the Sox6 KO skeletal muscles compared to control 



TA EDL Gas Sol 



Mouse age (month) 


2 


3 


2 


3 


2 


3 


2 


3 


Sox6 


0.51 


0.10 


0.94 


0.01 


0.12 


0.20 


0.36 


0.48 


Myh7 (MyHC-|3) 


779.78 


147.72 


22.36 


4186.18 


5.75 


11.24 


1.80 


2.71 


Myh4 (MyHC-llb) 


0.03 


3 x 10" 3 


0.20 


3 x 10" 3 


0.12 


2 x 10~ 3 


0.88 


0.30 


Ppargda (PGCI^x) 


3.27 


0.35 


2.06 


0.13 


0.22 


0.61 


0.85 


1.92 


Sdha 


1.02 


0.38 


1.93 


0.68 


0.97 


0.51 


1.00 


1.27 



Sox6 was inactivated using Myf5-Cre mice. A two month-old and a three month-old mice were examined. Control expression level = 1.00. 



inactivation caused a significant increase in the mRNA 
expression of Myh7 and a concurrent decrease in Myh4 
in the TA, EDL, and gastrocnemius muscles. The Sox6 
KO soleus muscle showed the least change in expression 
of these two MyHC isoforms (Table 1). This result likely 
reflects the observation that Sox6 expression in soleus is 
significantly lower than the other three fast muscles 
(Additional file 1, Figure SI A), therefore, Sox6 inactiva- 
tion may have had a less impact in soleus compared to 
the other muscles. Regarding the Sox6 inactivation levels 
in adult muscle, we noticed that a higher level of Sox6 
inactivation, determined by Sox6 mRNA level, did not 
necessarily correlate with an increase in Myh7 mRNA 
level. There are a few possible hypotheses to explain 
this observation. First, Sox6 is not a muscle specific 
gene and is also expressed in fibroblasts, which can 
obscure an accurate quantification of Sox6 mRNA speci- 
fic to muscle cells. Second, the Sox6 mutation is reces- 
sive in nature. Therefore, although two independent 
Sox6 KO muscle samples show 50% reduction in Sox6 
mRNA level, one sample may have more homozygous 
Sox6 null cells and the other may have more heterozy- 
gous cells, leading to a significant difference in Myh7 
expression. Third, skeletal muscle is multinucleated, 
which adds another layer of complexity as to how Sox6 
inactivation in each nucleus influences Myh7 expression 
in a myotube as a whole. 

To sort out these issues, we performed immunohisto- 
chemistry to examine the Sox6 and Myh7 (MyHC-P) 
protein expression at the cellular level in fetal, early post- 
natal and adult muscle. We focused our observation on 
the TA-EDL region, composed of fast-twitch myofibers 
in the adult mouse. As shown in Figure 1A, in E18.5 con- 
trol (Sox6 f/f ), nuclear Sox6 staining was well correlated 
with the absence of cytoplasmic MyHC-P staining. Also 
at P7, the presence of Sox6 nuclear staining corre- 
sponded to MyHC-P negative myotubes (Figure IB). In 
Sox6 KO muscle (Sox6 f/f ; Myf5-Cre), at both stages, 
nearly 100% of myofibers displayed MyHC-P expression 
(Figures 1A and IB). These data show that Sox6 expres- 
sion does not coincide with slow-twitch fiber gene 
expression, supporting our idea that Sox6 functions as a 



suppressor of the slow-twitch fiber gene program. There- 
fore, at the protein level, the loss of Sox6 expression 
clearly leads to upregulation of MyHC-P during the early 
stages of muscle development. During the normal mouse 
fast muscle development, the number of MyHC-P posi- 
tive slow-twitch fibers significantly decreases as postnatal 
skeletal muscles functionally mature [3]. We observed 
this trend in the developing control mouse muscles (Fig- 
ures 1A and IB), resulting in adult TA-EDL muscle with 
extremely rare MyHC-P positive myofibers (Figure 1C). 
In contrast to the control, at E18.5 and P7, nearly all 
Sox6 KO myofibers were MyHC-P positive (Figures 1A 
and IB), indicating that at these early stages, muscle-spe- 
cific Sox6 inactivation led to extensive upregulation of 
MyHC-P expression in the entire Sox6 KO muscle. In 
the adult Sox6 KO muscle, on the other hand, approxi- 
mately 50% of myofibers were MyHC-P positive, a signifi- 
cant increase compared to the control (Figure 1C, Sox6 f/ 
f ); however a significant decrease compared to the P7 
Sox6 KO muscle (Figure IB, Sox6 f/f ; Myf5-Cre). When 
Sox6 staining signals in the control and Sox6 KO adult 
muscles were compared, overall Sox6 staining signals 
were lower in Sox6 KO, however, it was hard to make a 
clear correlation with MyHC-P staining, since Sox6 stain- 
ing in adult muscle was quite diffused (Figure 1C, Sox6 f/ 
f ). In light of this, we noticed that in control P7 muscle, 
some MyHC-P -negative myofibers did not show nuclear 
Sox6 staining, but rather dispersed cytoplasmic Sox6 
staining (Figure IB, marked with * in Sox6 f/f ). This obser- 
vation may suggest an unknown additional mechanism to 
relocate the Sox6 protein from the nucleus and/or 
degrade it in differentiated, more mature myotubes. A 
recent report on Sixl/Six4 double KO muscle suggests 
that these two proteins positively regulate fast-twitch 
fiber differentiation and may also influence Sox6 nuclear 
localization during fetal muscle development (E18.5) 
[29] . In adult muscle, therefore, not only Sox6 expression, 
but other mechanisms such as the Sixl/Six4 regulated 
Sox6 shuttling may be in place to finalize fiber type gene 
expression in response to the environmental cues. 

In addition to the muscle structural protein genes, we 
also examined mRNA expression of the genes playing a 
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E18.5 

DAPI MyHC-P DAPI MyHC-P 



B 



Sox6 Merge 
Sox6 f/f 



DAPI 



Sox6 Merge 
Sox6 f/f ; Myf5-Cre 



P7 



MyHC-P 



DAPI 



MyHC-p 












Sox6 Merge 
Sox6 f/f 



Sox6 Merge 
Sox6 f/f ; Myf5-Cre 



MyHC-p 



4 month old 

DAPI MyHC-P 



MyHC-p 




Sox6 Merge Sox6 Merge Sox6 Merge 

Sox6 f/f Sox6 f/f ; Myf5-Cre Secondary only 

Figure 1 The number of fibers expressing MyHC-p is dramatically increased in Sox6 KO skeletal muscle. A. Cross-sections of El 8.5 lower 
hindlimb muscle (TA-EDL region) from control (Sox6 f/f ) and Sox6KO (Sox6 f/f ; Myf5-Cre) were stained with DAPI (blue) or specific antibodies for 
MyHC-p (green) or Sox6 (red). x400 magnification. B. P7 (7 day old) hindlimb muscle processed for DAPI, Sox6 and MyHC-p immune-staining* 
indicates myotubes that are negative for both MyHC-p and Sox6 staining in control muscle (see text for discussion). x400 magnification. C. Four 
month old adult TA-EDL muscle. A control muscle section stained with DAPI or secondary antibodies (a mixture of both anti-mouse and anti- 
rabbit) only was also shown. x200 magnification. 
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role in muscle metabolism, Ppargcla (PGC-la) and 
Sdha in adult muscle. Ppargcal is a co-regulator of 
mitochondrial biogenesis and oxidative phosphorylation 
[30,31] and Sdha is a component of TCA cycle and 
complex II of the mitochondrial respiratory chain, 
whose expression is activated by Ppargcla [30]. We 
speculated that Ppargcal and Sdha mRNA expression 
would also be upregulated in Sox6 KO muscle, because 
of a correlation between oxidative metabolism and slow 
fiber content reported in adult skeletal muscle [1,32,33]. 
In spite of this expectation, neither Ppargcla nor Sdha 
mRNA showed a noticeable increase in the Sox6 KO 
muscles (Table 1). This lack of correlation of the two 
gene programs was also observed in Sox6 KO muscles 
generated using MCK-Cre transgenic mice (Table 2, dis- 
cussed later in the text). In a recent report on the adult 
Sox6 KO muscle phenotype, Quiat et al. also reported 
that expression of Ppargcla was not changed [34]. 
Therefore, these results suggest that Sox6 plays a role in 
transcriptional regulation of the structural protein genes 
which define muscle fiber types, but not of the genes 



which define the metabolic state of skeletal muscle. In 
order to uncover the mechanisms of muscle differentia- 
tion that are regulated by Sox6 at the molecular level, 
we next performed Sox6 ChlP-seq analysis. 

Genome-wide Sox6 binding in skeletal myotubes 

To identify genome-wide binding of Sox6 in mouse ske- 
letal muscle, we performed ChlP-seq analysis. As the 
chromatin source, we chose wild type fetal (E.18.5) myo- 
tubes differentiated for 48 hours in vitro, because at this 
time point, a significant differential expression of slow 
fiber specific genes was observed between Sox6 null and 
wild type myotubes [13], suggesting an ideal time point 
to capture Sox6 acting as a transcriptional suppressor of 
those genes. Also, since Sox6 is highly expressed in 
fibroblasts (unpublished data), using a pure muscle cell 
population was necessary to identify muscle-specific 
Sox6 binding. We conducted two independent ChlP-seq 
experiments and obtained 3 and 1.5 million reads unam- 
biguously mapped to the mouse genome for each 
experiment (out of -20 million total reads). As a result, 



Table 2 Fold change in mRNA levels in the Sox6 KO skeletal muscles compared to control 



Mouse ID# 




TA 






EDL 






Gas 






Sol 




1 


2 


3 


1 


2 


3 


1 


2 


3 


1 


2 


3 


Sox6 


0.75 


0.42 


0.09 


0.43 


0.38 


0.07 


0.38 


0.53 


0.10 


0.20 


0.31 


0.12 


Myhl (llx/d) 


2.08 


1.68 


0.20 


1.31 


5.77 


5.49 


6.79 


14.80 


2.11 


0.01 


0.03 


1 x 10~ 3 


Myh2 (Ma) 


5.00 


1.96 


0.80 


3.08 


2.11 


4.72 


6.63 


5.77 


1.85 


0.02 


0.02 


u.d. 


Myh4 (lib) 


0.17 


0.01 


0.01 


0.11 


0.04 


0.02 


0.25 


1 x 10~ 3 


0.02 


0.16 


0.16 


0.01 


Myh6 (a) 


78.79 


252.99 


1.03 


24.19 


25.37 


6.65 


3.54 


4.13 


2.42 


4.03 


1.61 


0.65 


Myh7 ((3) 


9042.52 


2177.81 


94.12 


1318.30 


728.63 


1611.29 


21.86 


3.59 


9.68 


3.60 


1.00 


1.20 


Myh7b 


31.37 


n.d. 


0.49 


3.98 


n.d. 


1.39 


14.30 


n.d. 


3.06 


1.51 


n.d. 


0.34 


Myl2 


1496.57 


n.d. 


7.07 


57.22 


n.d. 


55.59 


7.37 


n.d. 


2.70 


1.91 


n.d. 


1.31 


Tnncl 


6830.19 


4622.47 


474.50 


665.43 


687.83 


128.70 


91.24 


167.58 


14.52 


0.81 


2.98 


1.53 


Tnnil 


2619.20 


1867.14 


610.53 


433.38 


552.33 


861.06 


116.58 


69.76 


24.71 


2.64 


2.58 


2.59 


Tnni2 


0.34 


0.50 


0.20 


0.36 


0.89 


0.15 


0.46 


0.76 


0.35 


0.01 


0.14 


2 x 10~ 3 


Tnntl 


4348.43 


1594.44 


157.77 


1011.39 


510.51 


153.18 


131.98 


59.18 


5.07 


2.93 


1.76 


1.56 


Tnnt2 


2.45 


1.88 


2.08 


2.31 


0.59 


1.55 


1.23 


2.65 


1.37 


2.59 


1.12 


1.85 


Tnnt3 


0.45 


0.40 


0.43 


0.28 


0.89 


0.58 


0.34 


0.38 


0.19 


4 x 10~ 3 


0.06 


2 x 10~ 4 


Chrng (fetal) 


0.89 


4.63 


4.94 


0.73 


1.80 


1.82 


1.23 


1.80 


3.29 


2.78 


1.17 


8.26 


Chrne (adult) 


0.44 


0.30 


0.28 


0.14 


0.18 


0.76 


0.16 


0.15 


0.14 


0.24 


0.34 


1.44 


Proxl 


29.26 


10.30 


9.66 


22.84 


6.68 


17.04 


16.21 


8.41 


3.34 


3.79 


2.26 


1.27 


Teadl 


1.83 


n.d. 


0.99 


0.63 


n.d. 


0.98 


1.25 


n.d. 


0.78 


1.02 


n.d. 


0.94 


Tead4 


3.03 


n.d. 


1.26 


1.34 


n.d. 


1.08 


1.23 


n.d. 


1.08 


0.96 


n.d. 


1.11 


Tcf4 


1.94 


n.d. 


1.70 


0.88 


n.d. 


1.54 


1.68 


n.d. 


1.15 


0.88 


n.d. 


1.43 


Hdac9 


4.44 


n.d. 


2.68 


1.51 


n.d. 


1.79 


7.56 


n.d. 


1.42 


1.34 


n.d. 


0.75 


Myodl 


0.73 


1.01 


0.70 


0.73 


1.02 


2.11 


1.47 


1.44 


1.17 


1.51 


1.77 


0.72 


Myog 


3.64 


1.52 


2.24 


1.98 


1.84 


3.00 


2.61 


1.12 


0.95 


4.48 


1.89 


1.35 


Mb 


1.84 


1.47 


1.48 


0.66 


2.30 


1.75 


1.88 


3.50 


1.54 


0.77 


1.62 


1.09 


Sdha 


1.18 


0.74 


0.39 


0.70 


0.74 


0.50 


0.79 


0.60 


0.32 


0.68 


0.59 


0.55 


Ppargcla 


1.28 


1.39 


0.53 


0.82 


0.78 


0.35 


0.63 


1.46 


0.42 


0.53 


0.71 


0.19 



Sox6 was inactivated using MCK-Cre mice. Mouse 1 and mouse 2 are two month-old, and mouse 3 is three month-old. Gene names in bold: associated with Sox6 
binding. Gene names in italics: slow fiber specific sarcomere proteins, or transcription factors reported that are preferentially expressed in slow fibers, n.d.: not 
determined, u.d.: undetected in Sox6 KO mouse. Control expression level = 1.00. 
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we identified 1,066 Sox6 peaks common to the two 
ChlP-seq data sets. These peaks were assigned to a total 
of 867 mouse RefSeq genes. The vast majority of the 
Sox6 binding sites were located in intronic regions 
(48.4%), followed by intergenic regions (more than 20 
kb away from transcription start site (TSS) or transcript 
end) (29.2%) and 5'-upstream region (within 20 kb of 
TSS including promoter) (13.6%) (Figure 2). 

To determine whether any known transcription factor 
consensus sequences are over-represented within the 
Sox6 peak regions, a motif search was performed. Motif 
analysis using MEME (Multiple EM for Motif Elicita- 
tion) [35] identified four known transcription factor 
consensus motifs in the Sox6 peaks (Figure 3). When 
the occurrence of a single motif was set to 0 or 1 per 
peak, 723 Sox motifs (P < 10" 4 ) and 636 E-box motifs (P 
< 10" 4 ) were identified. The fact that the Sox consensus 
motifs were found in the overwhelming majority of the 
Sox6 peaks (723 out of 1,066) suggests that the Sox6 
binding sites identified here are bona fide Sox6 targets. 
The E-box motifs (CAG[C/G]TG) identified using the in 
silico method here were identical to the E-box motifs 
which were enriched in MyoD binding sites detected 
using C2C12 myotubes [36]. Comparing our data with 
the MyoD ChlP-seq data obtained from adult mouse 
primary myotubes [36] revealed that 96% of the Sox6 
peaks were localized within 50 bp of the MyoD peaks 
(data not shown). 

In addition to Sox motif and E-box, Runx and Tead/ 
MCAT motifs were also found in the Sox6 peaks. When 
the occurrence of a single motif was set to 1, we identi- 
fied 559 Runx motifs (P < 10" 4 ) and 203 Tead/MCAT 



E-box 



Runx 




(Njo<<fintor^coo>o 




Sox 



Tead/MCAT 




cMcoTtm<or«-ooo)Oi-c«j 




cm co *t in «o f- 



Figure 3 Transcription factor consensus motifs found in Sox6 
binding peaks. 



motifs (P < 10" 4 ). A recent report has shown that Runxl 
has a role in skeletal muscle terminal differentiation 
[37]; therefore, Runx transcription factors might be 
involved in muscle specific gene expression together 
with Sox6. Tead/MCAT elements are known to play an 
important role in transcriptional regulation of many ske- 
letal and cardiac muscle-specific genes [38]. A signifi- 
cant presence of Tead/MCAT motifs in the Sox6 peaks, 
therefore, implies possible interactions between Sox6 
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and the Tead transcription factors during muscle 
differentiation. 

Transcriptional status of the genes associated with Sox6 
binding sites 

In order to determine the transcriptional status of Sox6 
peak-associated genes in differentiating fetal myotubes, 
we performed ChlP-seq analysis using an antibody 
recognizing a phosphorylated form of Pol II, which is 
considered to be a transcriptionally active form and asso- 
ciated with highly transcribed genes [39] . To quantify Pol 
II binding levels of RefSeq genes associated with Sox6 
binding sites, Pol II binding events in the corresponding 
gene regions were measured in RPKM (reads per kilobase 
of gene region per million reads), a unit used to quantify 
transcriptional levels in RNA-seq analysis [40]. RPKM 
was calculated from read (tag) numbers in peak regions, 
length of RefSeq gene regions, and total number of 
uniquely mapped reads (details in Methods). By this 
method, the Pol II binding level of the P-actin gene, an 
abundantly expressed housekeeping gene, was calculated 
as 8.60 RPKM. Figure 4 summarizes the fold enrichment 
of the Sox6 peaks and the corresponding Pol II binding 
of the 867 RefSeq genes associated with Sox6 peaks. We 
found that the majority of the Sox6 binding site(s)-asso- 
ciated genes were inferred to be transcriptionally inactive 
(zero to a very low level of Pol II binding). As shown in 
Figure 4 andAdditional file 2, Table SI, of the 867 genes 



250 - 



associated with Sox6 binding sites, 442 genes (51%) 
showed no Pol II binding (0 RPKM) and 289 genes (33%) 
showed less than one tenth of the Pol II binding to the P- 
actin gene (<0.86 RPKM), thus 84% of the genes asso- 
ciated with Sox6 binding sites are considered to be tran- 
scriptionally inactive or transcribed at a very low level in 
myotubes. These data strongly suggest that the binding 
of Sox6 to its targets mostly results in transcriptional 
suppression. The rest of the Sox6 peak associated genes 
were transcribed mostly at a range of low to moderate 
levels (less than half of the Pol II binding to the P-actin 
gene). There were, however, a small number of Sox6 
peak-associated genes that exhibited a high level of Pol II 
binding. For example, Myl4 (embryonic MyLC isoform), 
Tnncl, and Myh3 (embryonic MyHC isoform) showed a 
relatively high level of Pol II binding (>4.30 RPKM). In 
the case of Tnncl, one of the two Sox6 peaks was identi- 
fied in the first intron (Additional file 3, Figure S2D), 
where a muscle enhancer element was reported [41]. 
Therefore, an unidentified enhancer element may exist in 
the vicinity of the Sox6 binding sites in Myl4 and Myh3. 

Functional characterization of the genes associated with 
Sox6 binding sites 

Gene Ontology (GO) analysis revealed that the Sox6 
peak-associated genes showed the highest enrichment 
for the GO categories relevant to muscle cytoskeleton 
and myofibril establishment (Table 3). Many of these 
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867 genes ranked by Pol II binding and chromosomal location 

Figure 4 Comparison of Sox6 binding and Pol II binding to the Sox6 target genes. Left Y axis shows fold enrichment of Sox6 obtained by 
the peak calling program SISSRs using the 3 million read data set (see the text for detail), and right Y axis shows Pol II binding levels to the 
Sox6 peak-associated genes measured in RPKM. X axis shows all Sox6-associated genes (867 RefSeq genes in total) sorted according to Pol II 
binding and chromosomal location. When multiple Sox6 peaks were associated with one gene, only the peak with the highest fold enrichment 
was used. Note that Pol II binding to p-actin, an abundantly expressed housekeeping gene, was 8.60 RPKM. A similar result was obtained using 
the 1.5 million read-data set (data not shown). 
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Table 3 Biological processes enriched among genes associated with Sox6 peaks 


GO term 


P value 


GO:0015629~actin cytoskeleton 


2.58E-013 


GO:0008092~cytoskeletal protein binding 


4.27E-013 


GO:0005856~cytoskeleton 


4.51 E-012 


GO:0003779~actin binding 


2.93E-01 1 


GO:003001 6~myofibril 


3.88E-008 


GO:0043292~contractile fiber 


7.80E-008 


GO:0044430~cytoskeletal part 


1 .67E-007 


GO:0030054~cell junction 


6.40E-007 


GO:0045944~positive regulation of transcription from RNA polymerase II promoter 


1 .40E-006 


GO:0045893~positive regulation of transcription, DNA-dependent 


2.93E-006 


GO:0051 254~positive regulation of RNA metabolic process 


3.50E-006 


GO;00075 1 7~muscle organ development 


5.48E-006 


GO:003001 7~sarcomere 


5.82E-006 


GO:0043228~non-membrane-bounded organelle 


6.45E-006 


GO:0043232~intracellular non-membrane-bounded organelle 


6.45E-006 


GO:0006357~regulation of transcription from RNA polymerase II promoter 


7.81E-006 


GO:000701 0~cytoskeleton organization 


8.98E-006 


GO:001 0628~positive regulation of gene expression 


1.05E-005 


GO:0045941 ~positive regulation of transcription 


1.25E-005 


GO:0045935~positive regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process 


1.37E-005 


GO:0044449~contractile fiber part 


1 .39E-005 


GO:0010557~positive regulation of macromolecule biosynthetic process 


1.59E-005 


GO:0060537~muscle tissue development 


1.77E-005 


GO:0031328~positive regulation of cellular biosynthetic process 


2.01E-005 


GO:0009891 -positive regulation of biosynthetic process 


2.50E-005 


GO:0007507~heart development 


2.69E-005 


GO:0048729~tissue morphogenesis 


2.78E-005 


GO:0051 173~positive regulation of nitrogen compound metabolic process 


2.88E-005 


GO:0042692~muscle cell differentiation 


3.41E-005 


GO:0006936~muscle contraction 


3.79E-005 



The thirty most enriched Gene Ontology (GO) biological process terms are listed. 



genes encode muscle sarcomeric proteins which define 
fiber types, cardiac isoforms, and developmental iso- 
forms in muscle. For instance, Myhl (fast MyHC-IIx/d), 
Myh2 (fast MyHC-IIa), Myh6 (cardiac isoform, MyHC- 
a), Myh7 (slow MyHC-P), Myh7b (myosin, heavy chain 
7B, cardiac muscle, beta), Tnncl (troponin C, cardiac/ 
slow skeletal), and Tnnil (troponin I, skeletal, slow 1) 
were represented. The profiles of Sox6 binding and Pol 
II binding for these genes are summarized in Additional 
file 3, Figure S2A-E. Except for Tnncl (4.72 RPKM) and 
Tnnil (3. 52 RPKM), Pol II binding levels of these genes 
were very low (Additional file 3, Figure S2A-E). It 
should be noted that Sox6 peaks were not detected for 
Myh4 (Additional file 3, Figure S2A) which encodes the 
fastest adult myosin isoform MyHC-IIb [2,3] nor for 
Myh8 (data not shown) which encodes the perinatal fast 
MyHC isoform [5]. This suggests that Sox6 is not 
directly involved in transcriptional regulation of the 



fastest MyHC isoforms expressed in fetal or adult skele- 
tal muscle. 

Another noticeable GO term category enriched in the 
genes associated with Sox6 peaks involved regulation of 
transcription (Table 3). For instance, Sox6 peaks were 
found in the vicinity or in the gene region of transcriptional 
regulators including (but not limited to) Proxl, Sox6, 
Teadl, Tead4, Tcf4, Hdac9 ) Hdacll, and Nfatc3 (Addi- 
tional file 3, Figure S2F-M). These genes (except for 
Hdacll) are known to play a role in not only skeletal mus- 
cle development, but also heart development 
[12,13,38,42-49]. In spite of its high expression in skeletal 
muscle, the role of the class IV histone deacetylase Hdacll 
[50] in muscle development is yet to be discovered [51]. 

Proxl encodes a transcription factor expressed in slow 
muscle in zebrafish [47]. Though its role in mammalian 
skeletal muscle development is yet to be reported, we 
hypothesize that the Proxl protein also plays a role in 
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slow muscle fiber differentiation in mice. To support 
this, we have found that Proxl mRNA is preferentially 
expressed in the slow soleus muscle compared to the 
EDL, TA, and gastrocnemius muscles in adult (Addi- 
tional file 1, Figure SIB). Therefore, the Proxl protein 
may play a role in slow fiber differentiation during mus- 
cle development as well as maintenance of slow muscle 
in adult. In the Sox6 gene region, two Sox6 peaks were 
detected in the fifth intron (Additional file 3, Figure 
S2G). Existence of Sox6 binding sites and very low levels 
of Pol II binding in the Sox6 gene region may suggest a 
self-regulatory mechanism of Sox6 transcription during 
skeletal muscle development, as has been recently 
reported for erythrocyte development [52]. 

We also examined whether Teadi, Tead4, Tcf4, 
Hdac9 and Hdacll are differentially expressed between 
slow and fast muscles. We found that Teadi, Tead4, 
Tcf4 and Hdac9 were all expressed higher in the slow 
soleus muscle than the group of fast muscles, EDL, TA, 
and gastrocnemius (Additional file 1, Figure S1C-F). 
Hdacll, on the other hand, was expressed slightly 
higher in the fast muscles than soleus (Additional file 1, 
Figure S1G). These results suggest that Teadi, Tead4, 
Tcf4 and Hdac9 may also positively regulate slow fiber 
specific genes. The association of Sox6 peaks to these 
transcriptional regulatory genes suggests that Sox6 may 
be indirectly regulating muscle development through 
these key transcription regulators. 



Sox6 binding to the genes described above was vali- 
dated by ChlP-qPCR (Additional file 4, Figure S3). 

Muscle specific inactivation of Sox6 results in significant 
upregulation of slow fiber, cardiac, and developmental 
isoform genes in skeletal muscle 

The observation that the majority (84%) of the genes 
associated with Sox6 binding sites show little or no Pol 
II binding (Figure 4) supports our hypothesis that a 
major function of Sox6 during myogenesis is transcrip- 
tional suppression. To further evaluate this hypothesis, 
we next analyzed mRNA expression of selected genes 
associated with Sox6 binding in Sox6 KO muscle. For 
this, we used MCK-Cre mice (harboring the Cre gene 
under the control of the muscle creatin kinase promo- 
ter) to assess the effect of Sox6 inactivation in skeletal 
muscle as well as in cardiac muscle [53,54], since many 
of the putative Sox6 target genes are also expressed in 
the heart. 

First, mRNA levels of the eighteen genes (eight fiber 
type-specific genes, two cardiac isoform genes, one 
developmental isoform gene and five transcription fac- 
tors and two histone modification enzyme genes) were 
compared between control and Sox6 KO mice using 
newborn skeletal muscle (Figure 5). Sixteen genes out of 
the eighteen tested showed a significant increase in 
mRNA expression in the newborn Sox6 KO skeletal 
muscle (Figure 5). Nfatc3 and Hdacll showed a 
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Myh1 Myh2 Myh6 Myh7 Myh7b Myl2 Myl4 Tnnrt Tnnil Tnntl Tnnt2 Proxl Tc14 Teadi Tead4 Nfatc3 Hdac9Hdac11 

Figure 5 Differences of expression levels of Sox6 target genes between control and Sox6 KO perinatal mice RT-qPCR was performed 
using total RNA from skeletal muscle of control (Sox6 f/f ) and Sox6 KO (Sox6 f/f ; MCK-Cre) newborn (postnatal day 1) mice. Expression levels in 
Sox6 KO mice were divided by those in control mice, and represented as mean ± SD (n = 3). The broken line corresponds to the expression 
ratio of 1, indicating equal expression level between the KO and control mice. (*) P < 0.05; (**) P < 0.005. Fiber type specific genes: Myhl, Myh2, 
Myh7, My7b, Myl2 (also expressed in the heart), Tnncl (also expressed in the heart), Tnnil, Tnntl; cardiac isoforms: Myh6, Tnnt2; developmental 
isoform: My/4; transcription factors: Proxl, Tcf4, Teadi, Teod4, Nfatc3; histone modification enzymes: Hdac9, Hdacll. 
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tendency to be increased in Sox6 KO muscle, even 
though the difference was not statistically significant 
(Figure 5). These results indicate that Sox6 functions as 
a suppressor for these genes in developing muscle. 

Next, we assessed mRNA expression of fifteen genes 
out of the eighteen tested above (Myl4, Nfatc3, and 
Hdacll were excluded) as well as fast fiber specific 
genes, myogenic regulatory factors, and metabolism 
related genes in adult Sox6 KO muscle (Table 2). The 
slow fiber specific sarcomeric protein genes which had 
shown increased expression in Sox6 KO newborn mus- 
cle (Myh7, My 12, Tnncl, Tnnil, and Tnntl) displayed an 
even greater fold increase in mRNA expression in adult 
Sox6 KO muscles compared to control (Table 2). 
Among the four muscle groups tested, the TA and EDL 
Sox6 KO muscles showed the most dramatic increase in 
slow fiber specific gene expression, the soleus exhibiting 
the least fold increase (Table 2), again likely reflecting 
the lower Sox6 expression in the soleus than the fast 
muscles TA, EDL, and gastrocnemius (Additional file 1, 
Figure SI A). The fast fiber specific genes (Myh4, Tnni2, 
and Tnnt3) exhibited a significant decrease in their 
mRNA expression in Sox6 KO muscles (Table 2). Myhl 
(Ilx/d) and Myh2 (Ha), were either increased or 
decreased in different Sox6 KO muscle groups (Table 
2), which may reflect the fluid nature of MyHCs Ha and 
Ilx/d's expression in adult skeletal muscle [2]. These 
two MyHC isoforms are intermediates between MyHC- 
P and MyHC-IIb when fiber type shift occurs in skeletal 
muscle. Therefore, they could be more sensitive to the 
timing and level of the Sox6 gene inactivation, leading 
to varied expression in the individual Sox6 KO muscles. 
Upregulation of the cardiac isoform genes, Myh6 and 
Tnnt2, was also observed in the adult Sox6 KO muscle 
(Table 2). 

The significant upregulation in the slow fiber and cardiac 
isoform gene expression in adult Sox6 KO skeletal muscle 
likely suggests that inactivation of the Sox6 gene early in 
myogenic development inhibited the postnatal maturation 
of the skeletal muscle. Postnatal development of skeletal 
muscle is characterized by the progressive decline of slow 
fiber specific gene expression in fast muscles [6,7,55]. As a 
result, control EDL and TA muscles express only a trace 
amount of the MyHC-P protein [6,56,57]. The extreme 
upregulation of the slow fiber specific genes such as Myh7, 
Tnncl, and Tnntl in the Sox6 KO fast muscles may reflect 
their suspended postnatal maturation. This delayed 
maturation hypothesis is supported by the observation that 
the embryonic isoform acetylcholine receptor (Ach-R) y 
(Chrng) is expressed at a higher level than the adult iso- 
form Ach-R s (Chrne) in Sox6 KO muscles (Table 2). Dur- 
ing postnatal maturation of skeletal muscle, Ach-R y is 
replaced by the adult isoform Ach-R s [58]. In the adult 
Sox6 KO muscles, silencing of Chrng was not seen and 



Chrne expression did not reach to the control level (Table 
2). Since we have located one Sox6 peak in the Chrng pro- 
moter region (approximately 135 bp upstream of the TSS), 
Sox6 may be directly suppressing transcription of Chrng 
during normal skeletal muscle development. 

Transcriptional regulatory genes associated with Sox6 
peaks are upregulated in Sox6 KO adult muscle 

In addition to the sarcomeric protein genes, mRNA 
levels of some of the transcriptional regulatory genes 
associated with Sox6 peaks were upregulated in the 
Sox6 adult KO skeletal muscles. Proxl expression was 
significantly increased in Sox6 KO muscles, with the 
highest fold increase in the TA and EDL, followed by 
the gastrocnemius (Table 2). It should be noted that 
Proxl expression is highest in the soleus in the adult 
control (Sox6 f/f ) muscles (Additional file 1, Figure SIB). 
These observations suggest that Proxl may play a role 
for sustaining slow fiber specific gene expression in 
adult muscle. Tead4 and Hdac9 also showed a slight 
increase in their expression in the Sox6 KO TA, EDL, 
and gastrocnemius muscles (Table 2). Expression of 
Tcf4 and Teadl, on the other hand, showed no clear dif- 
ference between Sox6 KO and control adult muscles 
(Table 2), in spite of their higher expression in the Sox6 
KO newborn muscle (Figure 5). This result suggests that 
Sox6 may regulate transcription of Tcf4 and Teadl in 
developing muscle, but this regulation may not be main- 
tained through adult. 

Since it has previously been reported that MyoD and 
Myogenin are differentially expressed between slow and 
fast muscles (MyoD higher in fast than slow; Myogenin 
higher in slow than fast) [59,60], we also examined 
mRNA expression of these genes in Sox6 KO muscle. 
As shown in Table 2, there was no discernable change 
in MyoD mRNA expression, whereas there was a small 
increase in Myogenin mRNA expression in Sox6 KO 
muscles. An increase in Myogenin expression in Sox6 
KO muscle suggests that Myogenin may play some role 
in maintaining slow fiber phenotype in the adult skeletal 
muscle as previously proposed [60]. 

The level of transcriptional upregulation of metabolism 
related genes is less than that of slow fiber sarcomere 
protein genes in Sox6 KO muscle 

Since a close coupling between the slow fiber gene pro- 
gram and the oxidative metabolism gene program in 
adult skeletal muscle has been reported [1,3], we also 
examined mRNA expression of the genes whose high 
expression is correlated with the oxidative state of skele- 
tal muscle metabolism in Sox6 KO muscle (myoglobin, 
Sdha and Ppargcla). In MCK-Cre induced Sox6 KO 
muscle, mRNA levels of Ppargcla and Sdha were, in 
general, lower than control (Table 2). These results 
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replicated the data obtained using Myf5-Cre induced 
Sox6 KO muscle (Table 1). Myoglobin expression 
showed a slight increase in the Sox6 KO TA, EDL, and 
gastrocnemius (Table 2). When the color of gastrocne- 
mius and soleus muscles was visually inspected, the 
characteristic color difference between the two muscles 
in control muscle (soleus being redder than gastrocne- 
mius) was less clear in Sox6 KO muscle, because the 
Sox6 KO gastrocnemius exhibited an increase in redness 
in its color (Figure 6). This may reflect a small, but con- 
sistent increase in myoglobin expression in the Sox6 KO 
gastrocnemius muscle (Table 2). The more red muscle 
in Sox6 KO muscle has been also reported by Quiat et 
aL [34]. Interestingly, both Quiat et al. and our current 
report observed reduced expression in Ppargcla in Sox6 
KO muscle, which may suggest that there could be a 
pathway independent of Ppargcla regulating myoglobin 
expression in Sox6 KO muscle. An alternative explana- 
tion for the increased redness in the Sox6 KO muscle 
could be a change in capillary density. In the Sox6 peak 
associated RefSeq genes, GO terms related to vascula- 
ture development and angiogenesis were also enriched 
CP < 2 x 10" 3 ) (Additional file 5, Table S2). Thus, the 
increased capillary density could be the cause of more 
red color of the Sox6 KO gastrocnemius. 

Fetal isoform gene expression is upregulated in the Sox6 
KO heart 

Since the two cardiac MyHC isoform genes, a and P 
(Myh6 and Myh7), were associated with Sox6 binding 
(Additional file 3, Figure S2B) and their expression was 
upregulated in Sox6 KO skeletal muscle (Figure 5, Table 
2), we next examined their expression in the Sox6 KO 
heart. In the mouse heart, expression of MyHC-a and 
MyHC-P is developmentally regulated. MyHC-p is the 
fetal isoform in the heart and is replaced by the adult 




Control Sox6 KO 



(Sox6 f/f ) (Sox6 f/f ; MCK-Cre) 

Figure 6 Morphological difference of skeletal muscle between 
control and Sox6 KO mice. Dissected gastrocnemius/soleus 
muscles from three month old mice are shown. A. control (Sox6 f/f ) 
muscle. B. Sox6 KO (Sox6 f/f ; MCK-Cre) muscle. 



isoform MyHC-a within the first week after birth [61]. 
As summarized in Table 4 it appears that this isoform 
transition, fetal to adult, is incomplete in the Sox6 KO 
myocardium. In the Sox6 KO heart, MyHC-P expression 
was sustained at an equal to a slightly higher level than 
control heart, whereas MyHC-a expression decreased to 
approximately the half of the control level (Table 4). To 
test if this is a developmental defect in the postnatal 
heart, we also examined expression of the developmen- 
tally regulated skeletal a-actin gene, which is expressed 
in the fetal heart and silenced later in adult [62]. Indeed, 
skeletal a-actin mRNA expression was consistently 
higher in the Sox6 KO heart (Table 4), suggesting that 
the Sox6 KO heart is developmentally more immature 
than the control heart. Interestingly, the expression of 
Ppargcl was also lower in the Sox6 KO heart (Table 4). 
Since Ppargcl plays an important role in maturation of 
the metabolic state and mitochondrial biogenesis in the 
postnatal heart [63-65], this result suggests that the loss 
of Sox6 caused a delay in the postnatal maturation of 
the heart, thus Sox6 may also be necessary for the func- 
tional maturation of cardiac muscle. 

Nfatc3 protein expression is highly upregulated in Sox6 
null myotube cultures 

It has been reported that Nfatc3 stimulates myogenic 
differentiation both in vivo and in vitro [43,45]; how- 
ever, its implication in muscle fiber type specification 
has not been noted. Calcineurin-directed dephosphoryla- 
tion of NFAT factors results in their nuclear localization 
and transcriptional activation of their target genes [66]. 
We have located one Sox6 peak in the last intron of 
Nfatc3 (Additional file 3, Figure S2J). As shown in Fig- 
ure 5, Sox6 KO newborn skeletal muscle showed a 
small increase (not statistically significant) in Nfatc3 
mRNA expression. To assess whether the Nfatc3 activity 
increases in Sox6 null myotubes, we examined sub-cel- 
lular localization the Nfatc3 protein using Western blot. 
We took advantage of the Sox6 null mouse (p 100H -Sox6 
null mutant allele) in our laboratory to obtain a pure 
population of Sox6 null myotubes [12,13,26]. Fetal 



Table 4 Fold change in mRNA levels in the Sox6 KO heart 
compared to control 







Heart 




Mouse ID# 


1 


2 


3 


Sox6 


0.23 


0.16 


0.30 


Myh6 


0.67 


0.61 


0.51 


Myh7 


3.40 


1.78 


1.25 


Actal (sk-actin) 


4.28 


4.36 


2.55 


Ppargcla 


0.52 


0.57 


0.51 



Sox6 was inactivated using MCK-Cre mice. Mouse 1 and mouse 2 are two 
month-old, and mouse 3 is three month-old. Control expression level = 1.00. 
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myoblasts were prepared from E18.5 p 100H -Sox6 null 
and wild type littermates and were differentiated in dif- 
ferentiation medium (DM). In undifferentiated myoblast 
cultures, the amount of the nuclear as well as cytoplas- 
mic Nfatc3 protein was comparable between Sox6 null 
and wild type (Figure 7). Once myotube differentiation 
was induced, in wild type cultures, the Nfatc3 protein 
was detected only in the nuclear fraction, whereas in the 
piooH cu ij- ureSj a continuous presence of the cytoplasmic 
Nfatc3 protein and a higher level of the nuclear Nfatc3 
protein (compared to wild type) were observed (Figure 
7). We have previously reported that Sox6 expression is 
significantly increased upon induction of myotube differ- 
entiation [13]. Therefore, these results suggest that a 
higher level of Sox6 expression in wild type myotubes 
likely suppressed new synthesis of Nfatc3, while the 
absence of Sox6 in p 100H myotubes allowed continuous 
Nfatc3 synthesis. These results suggest that Nfatc3 activ- 
ity is upregulated in the Sox6 null myotubes which show 
a higher level of slow fiber specific gene expression. 

Functional analysis of the Sox6 binding sites 

In order to characterize the functional nature of the 
Sox6 binding sites in transcriptional regulation, we next 
performed reporter gene assays. We chose five Sox6 
peak- associated genes, Myh7 (MyHC-P), Myh7b, Tnncl, 
Tnniland Hdacll, in which Sox6 binding was validated 
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Figure 7 Nfatc3 protein levels are increased in both the 
nucleus and cytoplasm of Sox6-null myotubes. Wild type (WT) 
and Sox6-null {p 100H ) fetal primary myoblasts or myotubes were 
harvested at 0, 24, 48, 72, and 96 h after switching to DM, and 
nuclear and cytoplasmic protein was extracted for Nfatc3 Western 
blotting. TATA binding protein (TBP) and tubulin were used as 
loading controls for nuclear and cytoplasmic fractions, respectively. 
Upper panels: nuclear fractions. Lower panels: cytoplasmic fractions. 



by ChlP-qPCR (Additional file 4, Figure S3). All of these 
Sox6 peaks tested contained a Sox consensus motif. 
Firefly luciferase vectors containing each of the follow- 
ing sequences, -3.5 kb Myhl S'-upstream sequence (two 
Sox6 peaks; MHC3500), ~6 kb Myh7b 5'-upstearm 
sequence (one Sox6 peak), -1.3 kb of the Tnncl first 
intron (one Sox6 peak), -5.2 kb Tnnil S'-upstream 
region (two Sox6 peaks), and ~1 kb Hdacll S'-upstream 
sequence (one Sox6 peak) were generated (Figure 8A; 
see Additional file 3, Figure S2B-E and S2M for the 
location of Sox6 peaks). It should be noted that the 
proximal Sox6 peak in the Tnnil 5'-upstream region 
(approximately -800 bp from TSS) overlapped with the 
previously reported slow upstream regulatory element 
(SURE) containing an enhancer element [67]. 

To assess whether these Sox6 binding sites function as 
a negative or positive regulatory element, the luciferase 
reporter gene constructs described above were transi- 
ently transfected to p 100H -Sox6 null and wild type myo- 
blasts, differentiated in DM for 48 hours, after which 
firefly luciferase activities were compared between the 
Sox6 null and wild type myotube cultures. If these Sox6 
binding sequences function as negative regulatory 
regions, it is expected that the luciferase activity would 
be higher in p 100H myotube cultures in which no func- 
tional Sox6 protein is produced. As summarized in Fig- 
ure 8B, four out of the five sequences tested drove a 
higher firefly luciferase activity in Sox6 null myotube 
cultures compared to wild type, indicating that these 
Sox6 binding sites function as negative regulatory 
sequences. The Myh7b 5'-sequence did not drive a sta- 
tistically higher luciferase activity in Sox6 null myotube 
cultures (Figure 8B). Since the endogenous Myh7b 
expression was higher in Sox6 KO muscle (Figure 5), it 
is possible that the in vitro culture may not be the best 
approach to assess the effect of the Myh7b Sox6 biding 
regions. Intriguingly, Bell et al. have shown that Sox6 
protein overexpression in C2C12 cells could suppress 
transcription from the 1 kb Myh7b 5'-upstream 
sequence [68]. Therefore, there could be a Sox6 binding 
site not detected in our ChlP-seq analysis which may 
still be functioning as a negative regulatory element in a 
different context. 

We have previously shown that the proximal Sox6 
binding site (-200 bp from the Myh7 TSS) functions as 
a negative regulatory element in reporter gene assays 
[13]. In the present report, we have identified an addi- 
tional distal Sox6 binding site (-2900 bp from TSS) 
which overlaps with a known muscle enhancer element 
[69,70]. To delineate the two Sox6 binding sites in the 
Myh7 5'-upstrem region (see Additional file 3, Figure 
S2B for the peak locations), the distal Sox consensus 
sequence (-2,900 bp) was mutated in MHCP3500 (desig- 
nated as MHC P3500 m) (Figure 8A). As shown in 
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Figure 8 Differences of reporter activities between wild type and Sox6-null primary myotubes. A. Schematic representations of the firefly 
luciferase vectors constructed to test the function of Sox6 binding sequences. Black boxes indicate the firefly luciferase gene and shaded boxes 
indicate the chicken p-actin promoter (Actb-p). Open boxes indicate the upstream sequences of the wild type MyHC-(3 gene (MHC|33500) or its 
mutated version (MHQ33500 m; cross indicates mutation), Myh7b, Tnnil, and Hdocll, or the first intron sequence of Tnnd. Approximate 
positions of Sox6 binding sites are indicated by gray lines. B. The reporter constructs shown in A were cotransfected with a Renilla luciferase 
vector into wild type (WT) and Sox6-null {p 100H ) primary myoblasts. After differentiation into myotubes, both luciferase activities were measured 
and normalized with Renilla luciferase activity. Data were further normalized to WT MHC|33500 value (i.e. MHC|33500 in WT = 1.0), and 
represented as mean ± SD (n = 3). (*) P < 0.05; (**) P < 0.005. 



Figure 8B, the loss of the distal Sox motif did not affect 
the luciferase activity in either Sox6 null or wild type 
myotubes. This result suggests that the distal Sox motif 
has little effect on transcriptional suppression from the 
3.5 kb Myh7 5'-region in transient assays, and therefore, 
at least in the current in vitro assay conditions, the 
proximal Sox6 binding site is sufficient to suppress the 
transcription driven by the 3.5 kb Myh7 5'-upstream 
region. 

The function of the Sox6 binding site in the Tnncl 
first intron was determined using a hybrid luciferase 
reporter construct whose transcription is driven by the 
chicken P-actin promoter. The Tnncl first intron con- 
tains an enhancer element which was previously identi- 
fied using C2C12 and Sol8 skeletal muscle cell lines 
[41]. The presence of this intron alone significantly 
increased luciferase activity in wild type myotubes 
(Actb-p vs. Actb-p+Tnncl, p < 0.0001), confirming the 
enhancer activity (Figure 8B). The luciferase activity of 
the construct, Actb-p+Tnncl, in Sox6 null myotubes 
was significantly higher than wild type, indicating that 
Sox6 binding hindered the enhancer activity in this 
intron (Figure 8B). Unexpectedly, the construct contain- 
ing only the chicken P-actin promoter exhibited a small 
but statistically significant increase in luciferase activity 
in Sox6 null myotube cultures compared to wild type 
(Figure 8B). This was likely caused by the fortuitous pre- 
sence of a couple of Sox motif sequences in the chicken 
P-actin promoter and intron sequences in the vector 
(data not shown), which could have functioned as a 
weak silencer element. The 5'-upstream sequences of 



both Tnnil and Hdacll showed a moderate but statisti- 
cally significant increase in luciferase activity in Sox6 
null myotubes (Figure 8). 

Discussion 

In order to understand how Sox6 regulates muscle dif- 
ferentiation at the molecular level, we have performed 
ChlP-seq analysis to identify Sox6 targets in skeletal 
myotubes and extended the characterization of the Sox6 
null muscle phenotype using muscle specific Sox6 inac- 
tivation. Among the 867 Refseq genes found to be asso- 
ciated with Sox6 peaks, the overrepresented GO terms 
included muscle structure and function, skeletal muscle 
and heart development, as well as transcriptional regula- 
tion. In a concurrently conducted Pol II ChlP-seq analy- 
sis, we found that the majority of the Sox6 peak- 
associated genes exhibited little to no recognizable bind- 
ing peaks, suggesting that Sox6 mainly functions as a 
transcription suppressor in developing muscle. 

How does Sox6 suppress its target genes? Based on 
evidence from this and other labs, we can speculate on 
two possible mechanisms (1 and 2) and, based on evi- 
dence accumulated in this report we also demonstrate 
two other likely mechanisms (3 and 4): (1) Sox6 may 
fine-tune the transcription of the genes that have been 
marked by MyoD binding, (2) Sox6 may modulate tran- 
scription of its target genes in concert with Tead and 
Runx factors, (3) Sox6 suppresses transcription by hin- 
dering the muscle-specific enhancer activity, and (4) 
Sox6 also indirectly influences downstream gene expres- 
sion by regulating the expression of other transcription 
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factors and chromatin modifying enzymes. Below, we 
will discuss each of these proposed mechanisms in more 
detail 

MyoD is one of the myogenic regulatory factors and 
defines the myogenic lineage during development 
[71,72], In myotubes, MyoD binding events are frequent 
(-26,000 peaks with a higher cut off, -60,000 peaks 
with a lower cut off) and are associated with histone H4 
acetylation (H4Ac) [36], which is a marker of an active 
chromatin state [73]. We found that 96% of the Sox6 
peaks in fetal myotubes overlapped with, or were in the 
close vicinity to (within 50 bp), the reported MyoD 
peaks [34]. The E-box motifs in the Sox6 peak regions 
we found were enriched for the CAGCTG E-box 
sequence (Figure 3). Previously, it has been shown that 
this motif is represented in the peaks more strongly 
bound in C2C12 myotubes compared to myoblasts, indi- 
cating that this E-box motif is mainly associated with 
the genes regulating muscle differentiation [36]. Taking 
this observation together with ours, we speculate that 
MyoD binding in the myotube would change the chro- 
matin environment in such a way as to allow the 
approach of additional transcriptional regulators by 
recruiting the chromatin modifying enzymes [74], thus 
allowing the fine-tuning of muscle specific gene expres- 
sion necessary for the formation of mature skeletal mus- 
cle. Sox6 could be one of these additional 
transcriptional regulators and specify fiber type charac- 
teristics during muscle terminal differentiation. 

We have previously reported that Sox6 interferes with 
a MCAT enhancer located in close proximity to the Sox 
consensus motif in Myh7, causing suppression of Myh7 
transcription [13]. Tead/MCAT motifs are frequently 
found in enhancer or promoter regions of muscle speci- 
fic genes and it has been demonstrated that binding of 
TEF-1/Teadl to the MCAT motifs activates transcrip- 
tion of these muscle-specific genes [38,75]. In our analy- 
sis of the 1,066 Sox6 peaks, we found 203 MCAT 
motifs. This suggests that the mechanism of Myh7 tran- 
scriptional suppression by Sox6 (possibly via physical 
interference) we reported earlier may be a common 
mechanism Sox6 uses to suppress genes whose tran- 
scription is activated via Tead/MCAT motifs. Our analy- 
sis also revealed 559 Runx motifs in the 1,066 Sox6 
peaks. Currently, the roles of Runx motif binding factors 
(Runx-1, -2, and 3) in muscle development are not well 
known, though there are reports showing that Runxl 
plays a role in skeletal muscle differentiation [37,76,77]. 
In adult skeletal muscle, Runxl expression is induced by 
denervation [77], and muscle-specific Runxl inactivation 
leads to accelerated muscle wasting in denervated mus- 
cle [76]. In an earlier stage of muscle differentiation, it 
has been reported that Runxl directly interacts with 
MyoD preferentially in proliferating myoblasts to inhibit 



terminal differentiation of skeletal muscle [37]. The 
authors showed that the Runxl/CBFp complex recruits 
suppressive chromatin modifying enzymes (e.g. HDACs), 
thus inactivating transcription of the MyoD target genes 
that are necessary for the cell cycle exit and differentia- 
tion [37]. Since the Runx proteins have been shown to 
function as transcriptional suppressors or activators in 
different circumstances [78] (similar to Sox6), the tran- 
scriptional outcome of the possible interaction between 
the Sox6 and Runx proteins needs further investigation. 

As demonstrated in the Results section, the Sox6 
binding sites in the Tnncl first intron and the Tnnil 5'- 
upstream region both effectively reduced the activity of 
the enhancer elements (Figure 8). The molecular 
mechanisms by which Sox6 overrides muscle enhancers 
is currently under investigation; however, the skeletal 
muscle MyHC gene clusters may help shed light on this 
role of Sox6. In the six MyHC isoform genes clustered 
on the mouse chromosome 11 [Myh3 (emb), Myh2 (Ha), 
Myhl (Ilx/d), Myh4 (lib), Myh8 (peri), Myhl3 (eo)] [74], 
only the Myh4 and Myh8 genes were not associated 
with Sox6 peaks (a Sox6 peak was detected in the 5'- 
upstream region of Myhl3 in one of the two ChlP-seq 
data sets; data not shown). Therefore, Sox6 may be 
involved in sequential expression of the MyHC loci, pos- 
sibly in collaboration with an enhancer element similar 
to the locus control region (LCR) reported for the glo- 
bin gene cluster [79]. This is an appealing hypothesis, 
because it has been shown that Sox6 (acting as a tran- 
scriptional suppressor) regulates sequential expression of 
the p-globin genes during erythrogenesis [80] in concert 
with BCL11A which binds to the globin gene LCR [81]. 
There have been reports on transcription factories that 
unite transcriptionally active genes on separate chromo- 
some regions for coordinated transcription [82]. It is 
possible that association of Sox6 with its target 
sequences inhibits transcriptional initiation by Pol II, 
thus causing dissociation of Sox6 target genes from 
transcription factories. 

We demonstrated that expression of Teadl, Tead4, 
Hdac9, and Proxl was upregulated in Sox6 KO skeletal 
muscle (Figure 5), suggesting that Sox6 is a suppressor 
of these transcriptional regulatory genes. Teadl (TEF-1) 
and Tead4 (RTEF-1) are highly expressed in muscle tis- 
sues and have been reported to activate muscle specific 
gene transcription [83-85]. Hdac9 is a class Ha HDAC 
[86] and functions as a mediator of motor neuron input 
to skeletal muscle [87]. Proxl is expressed in slow mus- 
cle in zebrafish [47]. Since Proxl is preferentially 
expressed in slow fiber muscle in control mice (Addi- 
tional file 1, Figure SIB) and Sox6 inactivation caused a 
sizable increase in Proxl mRNA expression in Sox6 KO 
muscle, we propose that Proxl also plays a role in regu- 
lation of slow muscle fiber specific gene expression in 
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mice. This observation presents further evidence of evo- 
lutionary conservation in the mechanisms regulating 
muscle fiber type differentiation in vertebrates [19,88]. 
Since there are more transcriptional regulator genes that 
are closely associated with Sox6 peaks, which we did 
not have space to discuss in this report, it is likely that 
Sox6 is part of the transcriptional networks that shape 
the characteristics of both muscle development and 
mature muscle functions. 

The most striking phenotype of Sox6 null skeletal 
muscle is the dramatic increase in the expression of 
multiple slow fiber specific genes. This observation ori- 
ginally led us to hypothesize that Sox6 functions as a 
transcriptional suppressor of slow fiber specific genes 
[12,13]. In this report, we expanded the gene expression 
profiling of Sox6 KO skeletal muscle by including car- 
diac and embryonic muscle isoform genes. Cardiac iso- 
forms Myh6 and Tnnt2, as well as embryonic isoforms 
Myl4 and Chrng, were upregulated in the Sox6 KO mus- 
cle (Figure 5, Table 2). It has been reported that Tnnt2 
is upregulated in regenerating dystrophic muscle [89]. 
Myh6 is expressed in specialized craniofacial muscle, 
such as jaw and extraocular muscle, but not in limb or 
other body muscle [90,91]. These observations suggest 
that Sox6 may play a role in not only determining fiber 
types, but also defining developmental maturity and 
highly specialized functions of skeletal muscle. 

In Sox6 KO muscle, a significant decrease in fast fiber 
specific gene expression was also observed. This Sox6 
KO phenotype could be a secondary effect of the 
increased slow fiber gene products, or could be regu- 
lated indirectly by Sox6. Since we did not find Sox6 
peaks associated with fast fiber specific genes, both 
mechanisms are equally plausible. With regard to indir- 
ect regulation, a few possible mechanisms can be 
hypothesized. For example, expression of the transcrip- 
tion factors Sixl and Six4, activators of fast fiber specific 
gene expression [29,92], could be indirectly suppressed 
in Sox6 KO muscle during development. Alternatively, 
downregulation of fast fiber specific genes in Sox6 KO 
muscle could be caused by changes in microRNA 
expression. MicroRNAs are known to function as post- 
transcriptional regulators of gene expression [93]. A 
recent report indicates that microRNAs suppress target 
gene expression predominantly through mRNA degrada- 
tion [94], thus, it is plausible to postulate that an 
increase in microRNAs targeting fast fiber specific genes 
in Sox6 KO muscle leads to reduced fast fiber specific 
gene mRNA levels. As described above, we found Sox6 
binding peaks associated with Myh6 and Myh7 (Addi- 
tional file 3, Figure S2B). In the intron sequences of 
Myh6 and Myh7, miR-208a and miR-208b are encoded, 
respectively [95]. It has been reported that miR-208 sup- 
presses expression of THRAP1, which promotes fast 



fiber specific gene expression [96]. The increased tran- 
scription of Myh6 and Myh7 in Sox6 KO muscle, there- 
fore, could lead to upregulation of miR-208, which in 
turn, suppress fast fiber specific gene expression. How- 
ever, the actual situation is likely to be more complex. It 
should be noted that miR-208, along with miR-499, also 
targets the 3'-UTR region of Sox6 [68,97,98]. MiR-499 is 
encoded in the intron of Myh7b [95,99], which has a 
Sox6 binding site in its 5'-upstream region (Additional 
file 3, Figure S2C). Since Myh6, Myh7, and Myh7b are 
all negatively regulated by Sox6 (Figure 5), these data 
suggest that Sox6 and these miRNAs constitute two-way 
feedback loops. 

Figure 9 summarizes both our current results and the 
reported regulatory mechanisms for Sox6 expression. A 
recent report on the regulation of Sox6 expression in 
zebrafish skeletal muscle has demonstrated that Sox6 
transcription is positively regulated by MyoD and Myf5, 
and repression of Sox6 activity in slow fibers is main- 
tained by miR-499 which targets the Sox6 3'-UTR [100]. 
We have reported that Sox6 transcription is upregulated 
when myotube differentiation is induced [13], therefore, 
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Figure 9 Summary of the present work concerning the fiber 
type specification under the control of Sox6. It has been shown 
that in zebrafish, MyoD and Myf5 are necessary to activate Sox6 
gene expression in muscle [100]. This muscle-specific Sox6 
activation mechanism has not been tested in mice yet, but since 
Sox6 upregulation coincides with upregulation of these myogenic 
regulatory factors during muscle differentiation, it is very likely that 
this mechanism is shared in mice also (see text). Once expressed in 
muscle, Sox6 directly suppresses transcription of slow fiber specific, 
cardiac and embryonic isoform genes during muscle development. 
In addition to these structural protein genes, Sox6 suppresses 
expression of the transcription factors which have been shown to 
activate slow fiber specific genes, Teadl, Tead4, and Proxl. By 
unknown mechanisms, fast fiber-specific gene expression in Sox6 
KO skeletal muscle is dramatically reduced. Since Sox6 is 
preferentially expressed in fast-twitch fiber rich muscles, it is 
possible that Sox6 indirectly stimulates the fast fiber-specific gene 
program. This idea awaits future investigation. Sox6 activity in slow 
fibers is suppressed by miR-499 which is encoded in an intron of 
the Myh7b gene. We have shown that Sox6, in turn, suppresses 
Myh7b transcription. This negative feedback loop might be 
important for fiber type switching during muscle development as 
well as in adult muscle. 
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MyoD and Myf5 might also be activating Sox6 tran- 
scription during mammalian muscle development. Since 
MyoD is preferentially expressed in fast fibers in adult 
mice [59,60], it may sustain the higher level of Sox6 
expression in adult fast fiber muscles reported here as 
well as by Quiat et al [34]. Although the negative regu- 
lation of Sox6 by miR-499 has been already reported in 
mice [68,97,98], how suppression of Sox6 expression in 
slow fibers is initiated is not yet understood. Alterna- 
tively, it is also possible that Sox6 expression is activated 
when fast-twitch myotubes emerge during fetal muscle 
development [101]. Since fiber type-specific gene expres- 
sion in mammalian skeletal muscle during development 
as well as in adult life is very fluid [2-4], how Sox6 
expression is regulated will be an increasingly important 
question as we try to understand how muscle fiber type 
is initially specified, maintained and changed in rese- 
ponse to the external signaling. 

Conclusions 

We have shown that: (1) Sox6 directly suppresses the 
transcription of slow fiber-specific, cardiac, and embryo- 
nic isoform genes through binding to the transcriptional 
regulatory regions, (2) Sox6 regulates expression of tran- 
scriptional regulators critical for muscle development, 
therefore, extending its effect on muscle development by 
cross-talking with other regulatory pathways, (3) Loss of 
Sox6 in skeletal muscle results in a significant increase 
in expression of slow fiber-specific, cardiac, and embryo- 
nic isoform genes which are associated with Sox6 bind- 
ing peaks, accompanied by a decreased in fast fiber- 
specific gene expression, and (4) Loss of Sox6 in cardiac 
muscle results in increased expression of fetal isoform 
genes in the adult heart, which suggests that Sox6 is 
required for the postnatal maturation of cardiac muscle 
as well. Since the Sox6 KO phenotypes reported here 
have relevance to muscle degenerative diseases 
[102-105] as well as heart failure [106], uncovering the 
many functions of Sox6 in muscle development will 
likely contribute to the understanding of mechanisms of 
human muscular diseases. 

Methods 

Cell culture 

Isolation, culture, and induction of myotube differentia- 
tion in differentiation medium (DM) of fetal myoblasts 
isolated from mouse E18.5 limb were described pre- 
viously [13]. 

ChlP-seq 

ChIP experiments were performed using the Imprint 
Chromatin Immunoprecipitation Kit (Sigma-Aldrich) 
according to the manufacturer's instructions. Primary 
myotubes differentiated in DM for 48 h were washed 



with phosphate-buffered saline with 1 mM MgCl 2 (PBS- 
Mg) once, and then fixed with 2 mM disuccinimidyl 
glutarate (DSG; Thermo Fisher Scientific) for 45 min at 
room temperature as described by [107]. Cells were 
washed with PBS-Mg twice, and fixed further using 1% 
formaldehyde for 10 min at room temperature. Antibo- 
dies used were rabbit polyclonal antibody to Sox6 
(ab30455, Abeam), mouse monoclonal antibody to RNA 
Polymerase II (Pol II) CTD repeat YSPTSPS (4H8) 
(ab5408, Abeam), and normal mouse IgG from the ChIP 
kit. ChlP-seq library was prepared as described pre- 
viously [108]. Briefly, the immunoprecipitated material 
was end-repaired, A-tailed, ligated to the sequencing 
adapters, amplified by 18-cycles of PCR and size selected 
(300-600 bp) followed by single end sequencing on an 
Illumina Genome Analyzer II by the DNA Technologies 
Core Facility at University of California, Davis http:// 
dnatech.genomecenter.ucdavis.edu/. ChlP-seq data are 
available at Gene Expression Omnibus (accession num- 
ber GSE32627). 

ChlP-seq data analysis 

ChlP-seq reads and input sample reads were aligned 
using Bowtie (version 0.12.5) [109] with default para- 
meters to the mouse NCBI Build 37 genome assembly. 
We obtained 3 and 1.5 million uniquely mapped reads 
from two independent Sox6 ChIP experiments, and 1.5 
and 2.8 million uniquely mapped reads from two inde- 
pendent Pol II ChIP experiments, respectively. Peak call- 
ing was performed using SISSRs (version 1.4) [110] with 
the following parameters, -s (genome size): 
2,716,965,481 bp, -F (average length of DNA fragments): 
450 bp, -b (background file): input DNA data file of 
each ChIP experiment. Common peaks from the two 
Sox6 data sets were identified using ChlP-Seq Tool Set 
(version 1.0) [111]. Corresponding peaks within 50 bp 
were considered as overlapping. Peak annotation was 
carried out using Peak Analyzer [112] with the default 
mouse mm9 annotation file. ChlP-seq data was visua- 
lized on the UCSC Genome Browser [113]. Motif dis- 
covery was conducted using MEME (version 4.5.0) [35] 
with default parameters, followed by comparison against 
three motif databases (JASPAR, TRANSFAC, and UNIP- 
ROBE) using TOMTOM (version 4.5.0) [114]. Gene 
ontology (GO) analysis was performed using the Data- 
base for Annotation, Visualization and Integrated Dis- 
covery (DAVID; http://david.abcc.ncifcrf.gov/) [115,116]. 
Pol II binding was represented as RPKM (reads per kilo- 
base of RefSeq gene region per million mapped reads) 
based on read (tag) numbers in peak regions. We used 
the 2.8 million read data for calculation and visualiza- 
tion to maximize accuracy of result. When there are 
multiple RefSeq gene models per gene, the longest gene 
model was used for the calculations. When Pol II peaks 
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were extended to extragenic regions, the length of 
extended regions was added to that of RefSeq gene 
models. 

qPCR 

All measurements were conducted with ABI Prism 7900 
HT Sequence Detection System (Applied Biosystems). 
ChlP-qPCR was performed using Maxima SYBR Green/ 
ROX qPCR Master Mix (2X) (Fermentas) and specific 
primers listed in Additional file 6, Table S3. Single pro- 
ducts were confirmed by dissociation curve analysis. 
Results were normalized to input, and fold enrichment 
was calculated by normalizing to enrichment at a nega- 
tive control region (intergenic). RT-qPCR was per- 
formed using TaqMan Gene Expression Assays (Applied 
Biosystems). Total RNA was extracted with TRIzol 
reagent (Invitrogen). Following DNase treatment with 
DNA-free Kit (Ambion), cDNA was synthesized using 
High Capacity cDNA Reverse Transcription Kits 
(Applied Biosystems) with SUPERase-In (Ambion). 

TaqMan probes used are provided in Additional file 7, 
Table S4. Results were normalized to P-actin (Actb) 
transcript level. All statistical analyses were performed 
using the two-tailed Student's t-tests. Relative mRNA 
levels against P-actin are shown in Additional files 8 and 
9, Figures S4 and S5. 

Western blotting 

Nuclear and cytoplasmic fractionation of primary myo- 
blasts and myotubes was carried out using NE-PER 
Nuclear and Cytoplasmic Extraction Reagents (Thermo 
Fisher Scientific). A total of 30 ug of each protein sam- 
ple was loaded on 7.5% SDS-polyacrylamide gel electro- 
phoresis and transferred to a nitrocellulose membrane. 
The blot was incubated with anti-Nfatc3 mouse mono- 
clonal antibody (sc-8405; Santa Cruz Biotechnology) at 
1:100, and the signal was detected by Pierce ECL Wes- 
tern Blotting Substrate (Thermo Fisher Scientific). To 
estimate the amount of protein loaded in each lane, the 
same blot was stripped and then incubated with anti- 
TATA binding protein (TBP) mouse monoclonal anti- 
body (ab818; Abeam) or anti a-Tubulin mouse mono- 
clonal antibody (sc-8035; Santa Cruz Biotechnology) at 
1:1000. 

Plasmid construction 

A firefly luciferase expression vector driven by the 
MyHC-(3 promoter (MHCP3500, which contains 3,500 
bp of the 5' upstream sequence of the rat MyHC-P 
gene) was kindly provided by Dr. Baldwin at University 
of California, Irvine [117,118]. Using this vector as a 
template, a Sox motif in the distal Sox6 binding region 
found by our ChlP-seq experiments (approx. -2.9 kb in 
mouse) was mutated (T ACA AAG to TCAGAAG) by an 



inverse PCR method [119] using KAPA HiFi HotStart 
DNA Polymerase (KAPA Biosystems) to generate 
MHCP3500 m ("m" stands for mutation). Firefly lucifer- 
ase expression vectors driven by the upstream regions of 
Myh7b, Tnnil, and Hdacll genes (~ 6.0 kb, -5.2 kb, 
and ~1.0 kb, respectively) were generated by inserting 
restriction enzyme-digested PCR products into appropri- 
ate restriction sites of pGL3-basic vector (Promega). A 
firefly luciferase expression vector driven by the first 
intron of Tnncl gene was constructed by replacing the 
CMV enhancer of pTriEx-1.1 vector (Novagen) with the 
first intron of Tnncl (-1.3 kb) followed by insertion of 
a restriction enzyme-digested PCR product of firefly 
luciferase gene from pGL3-basic vector into the appro- 
priate restriction sites. A Renilla luciferase expresion 
vector driven by CMV promoter (pcDNA-Rluc) was 
produced by inserting the Renilla luciferase gene, which 
was obtained by digesting pRL-TK vector (Promega) 
with Nhel and Xbal, into pcDNA3.1/Zeo( + ) vector 
(Invitrogen). 

Primers used were listed in Additional file 10, Table 
S5. 

Reporter assay 

The reporter plasmids (see above) were co-transfected 
with the Renilla luciferase vector (pcDNA-Rluc) into 
mouse fetal primary myoblasts using Lipofectamine 
2000 (Invitrogen) at 1:1.5 ratio of DNA and Lipofecta- 
mine. Twenty four hours after transfection, cells were 
washed with PBS once, and switched to DM. Cells were 
incubated further for 48 h, and firefly and Renilla luci- 
ferase activities were measured using Dual-Glo Lucifer- 
ase Assay System (Promega) and a luminometer 
(LumiCount; Packard) according to the manufacturer's 
instructions. Statistical analyses were performed using 
the two-tailed Student's t-tests. As a negative control, 
pGL3-basic was used. 

Immunohistochemistry 

Lower hindlimbs were collected from E18.5 embryos 
and one week old mice (P7), and TA and EDL mus- 
cles were collected from four month-old mice and 
embedded in Tissue Freezing Medium (Triangle Bio- 
medical Sciences) for cryostat sectioning. Sections (10 
um) were fixed in 4% paraformaldehyde and processed 
for immunohistochemistry. As primary antibodies, 
rabbit polyclonal Sox6 antibody (ab30455, Abeam) at 
400 fold dilution and mouse monoclonal MyHC-P 
antibody at 100 fold dilution (NOQ7.5.4D, Sigma- 
Aldrich) were used. As secondary antibodies, Alexa 
Fluor 555-conjugated goat anti-rabbit IgG (A-21429, 
Invitrogen) and Alexa Fluor 488-conjugated goat anti- 
mouse IgG (A-11059, Invitrogen) were used. DAPI 
staining was performed to visualize nucleus. Images 
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were obtained at the confocal microscope facility at 
the UC Davis Genome and Biomedical Sciences 
Facility. 

Animal experiments 

The animal studies were carried out under the guidance 
issued by the University of California, Davis. 

Additional material 



control (Sox6 ) and Sox6 knockout (KO, Sox6 ; MCK-Cre) mice were 
calculated using the formula 2 _ACt . Two 2 month-old mice (mouse ID# 1 
and 2) and one 3 old-month mouse (mouse ID# 3) were analyzed. A. 
Relative mRNA level of Sox6 in TA, EDL, gastrocnemius (Gas), soleus (Sol), 
and the heart. B. Relative mRNA level of contractile protein genes in TA, 
EDL, Gas, and Sol. C. Relative mRNA level of transcriptional regulatory 
genes in TA, EDL, Gas, and Sol. D. Relative mRNA level of metabolism 
related genes and acetylcholine receptor genes in TA, EDL, Gas, and Sol. 
E. Relative mRNA level of Myh6, Myh7, Actal, and Ppargcla in the heart. 
*: not determined. #: undetected. 

Additional file 10: Table S5 Primers used for plasmid construction. 
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